PyDigger - unearthing stuff about Python

Found 2 out of 204,120. Showing 2 on page 1. Total pages: 1.

Name	Version	Summary	date
shtec-rlhf	0.0.2.dev0	shtec-rlhf: Safe Reinforcement Learning from Human Feedback	2024-04-19 03:10:53
trl	0.8.4	Train transformer language models with reinforcement learning.	2024-04-17 15:16:50

Found 2 out of 204,120. Showing 2 on page 1. Total pages: 1.